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Abstract 

This paper concerns a specific class of strict standard episturmian words whose directive words 
resemble those of characteristic Sturmian words. In particular, we explicitly determine all integer 
powers occurring in such infinite words, extending recent results of Damanik and Lenz (2003), who 
studied powers in Sturmian words. The key tools in our analysis are canonical decompositions and 
a generalization of singular words, which were originally defined for the ubiquitous Fibonacci word. 
Our main results are demonstrated via some examples, including the k-honacci word, a generalization 
of the Fibonacci word to a fc- letter alphabet (fc > 2). 
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1 Introduction 

Introduced by Droubay, Justin and Pirillo [5] , episturmian words are an interesting natural generalization 
of the well-known family of Sturmian words (aperiodic infinite words of minimal complexity) to an 
arbitrary finite alphabet. Episturmian words share many properties with Sturmian words and include 
the well-known Arnoux-Rauzy sequences, the study of which began in [1] (also see [16| I21j for example). 

In this paper, the study of episturmian words is continued in more detail. In particular, for a specific 
class of episturmian words (a typical element of which we shall denote by s), we will explicitly determine 
all the integer powers occurring in its constituents. This has recently been done in [B] for Sturmian words, 
which are exactly the aperiodic episturmian words over a 2-letter alphabet. 

A finite word w has a (non-trivial) integer power in an infinite word yiii — ww ■ ■ ■ w {p times) is a 
factor of X for some integer p > 2. Here, our analysis of powers occurring in episturmian words s hinges 
on canonical decompositions in terms of their 'building blocks'. Another key tool is a generalization of 
singular words, which were first defined in [23j for the ubiquitous Fibonacci word, and later extended to 
Sturmian words in 19J and the Tribonacci sequence in |22| . Our generalized singular words will prove to 
be useful in the study of factors of episturmian words, just as they have been for Sturmian words. 

This paper is organized as follows. After some preliminaries (Section [2]), we define, in Section [31 
a restricted class of episturmian words upon which we will focus for the rest of the paper. A typical 
element of this class will be denoted by s. In Section 21 we prove some simple results, which lead us to 
a generalization of singular words for episturmian words s. The index, i.e., maximal fractional power, 
of the building blocks of s is then studied in Section [5l Finally, in Section [6l we determine all squares 
(and subsequently higher powers) occurring in s. Our main results are demonstrated via some examples, 
including the k-bonacci word, a generalization of the Fibonacci word to a fc-letter alphabet (fc > 2). 

*E-mail: amy.glen@gmail.com 



2 Definitions and notations 



2.1 Words 

Let A denote a finite alphabet. A (finite) word is an element of the free monoid A* generated by A, 
in the sense of concatenation. The identity e of A* is called the empty word, and the free semigroup, 
denoted by A'^, is defined by A'^ := A* \ {s}. An infinite word (or simply sequence) x is a sequence 
indexed by N with values in A, i.e., x — xoXiX2 ■ ■ ■ , where each Xi € A. The set of all infinite words over 
A is denoted by A'^ , and we define A°^ := A* U A'^ . If u is a non-empty finite word, then u'^ denotes the 
purely periodic infinite word uuu ■ ■ ■ . 

li w — X1X2 ■ ■ ■ Xm G A'^ , each Xi G A, the length of w is \w\ — m and we denote by \w\a the number 
of occurrences of a letter a in w. (Note that \e\ — 0.) The reversal of w is w — XmXm-i ■ ■ '^i, and if 
w = w, then w is called a palindrome. 

A finite word is a factor of z G A°° if z = uwv for some u e A*, v € ,4°°, and we write w ~< z. 
Further, w is called a prefix (resp. suffix) of z if u = e (resp. v — e), and we write w ~<p z (resp. w -<s z). 

An infinite word x G A^ is called a suffix of z G A^ if there exists a word w G A^ such that z = uix. 
A factor w of a word z G is ri(7/it (resp. left) special if wa, wb (resp. aw, bw) are factors of z for some 
letters a, b E A, a ^ b. 

For any word w G -4°°, ri(w) denotes the set of all its factors, and fln{w) denotes the set of all factors 
of w of length n G N, i.e., il„(w) :— ri(w) n (where \w\ > n for w finite). Moreover, the alphabet of w 
is Alph(w) :— fl{w) HA and, if w is infinite, we denote by Ult(?x;) the set of all letters occurring infinitely 
often in w. Two infinite words x, y G A'^ are said to be equivalent if Q{y) = ^2(x), i.e., if x and y have 
the same set of factors. 

Let w — X1X2 ■ ■ ■ Xm G A* , each Xi G A, and let j G N with < j < m — 1. The j-th conjugate of w 
is the word Cj(w) :— Xjj^iXj+2 ■ ■ ■ XmXiX2 ■ ■ -Xj, and we denote by C(w) the conjugacy class of w, i.e., 
C{w) := {Cj(w) : < j < jwl — 1}. Observe that if w is primitive (i.e., not a power of a shorter word), 
then w has exactly lui] distinct conjugates. 

The inverse of w G A*, written w~^, is defined by ww^^ — w^^w = e. It must be emphasized that 
this is merely formal notation, i.e., for u,v,w G -4*, the words u^^w and wv^^ are defined only if u 
(resp. v) is a prefix (resp. sufhx) of w. 

A morphism on ^ is a map tp : A* A* such that ipluv) = ^{u)'ip{v) for all u,v E A*. It is uniquely 
determined by its image on the alphabet A. 



2.2 Episturmian words 

An infinite word t G A'^ is episturmian if f2(t) is closed under reversal and t has at most one right (or 
equivalently left) special factor of each length. Moreover, an episturmian word is standard if all of its left 
special factors are prefixes of it. 

Standard episturmian words are characterized in [I8| using the concept of the palindromic right- closure 
w'^^^ of a finite word w, which is the (unique) shortest palindrome having w as a prefix (see [7]). Specif- 
ically, an infinite word t G A'^ is standard episturmian if and only if there exists an infinite word 
A(t) = X1X2X3 . . ., each Xi G A, called the directive word of t, such that the infinite sequence of palin- 
dromic prefixes ui = e, M2, U3, . . . of t (which exists by results in 8J) is given by 

M„+i = (u„x„)W, nGN+. (2.1) 

Note. For any w G A'^ , ic^^^^ — wv^^w where v is the longest palindromic suffix of w. 

An important point is that a standard episturmian word t can be constructed as a limit of an infinite 
sequence of its palindromic prefixes, i.e., t = lim„^oo u„. 

Let a G .4 and denote by the morphism on A defined by 

a 1-^ a 

X i-^ ax for all x G ^ \ {a}. 
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Another useful characterization of standard episturmian words is the following (see |15|). An infinite word 
t £ A'^ is standard episturmian with directive word A(t) — X1X2X3 ■ ■ ■ {xi G A) if and only if there exists 
an infinite sequence of infinite words t^o) = t, t(i), t^^), . . . such that t^^-^) = ^'^^.(t^*)) for all i G N+. 
Moreover, each t*-*' is a standard episturmian word with directive word A(t*^*') = Xi^iXi^2Xi+3 • • ■ , the 
i-th shift of A{t). 

To the prefixes of the directive word A(t) = X1X2 • • • , we associate the morphisms 
and define the words 

which are clearly prefixes of t. We have the following useful formula [TS] 
and whence, for n > 1 and < p < n, 

Un = hn-2hn-3 ' ' ' /ll^O = ^n-2^n-3 ' ' ' ^p-lWp. (2-2) 

Some useful properties of the words h„ and u„ are given by the following lemma. 
Lemma 2.1. [11] For all n G 

(i) hn is a primitive word; 

(ii) h„ = /i„_i if and only if Xn+i = x„\ 

(iii) if Xn+i 7^ Xn, then u„ is a proper prefix of hn- □ 

Two functions can be defined with regard to positions of letters in a given directive word. For 
n G N+j let P{n) — sup{p < n : Xp = x„} if this integer exists, P{n) undefined otherwise. Also, let 
S{n) — inf{p > n : a;^ = a;„} if this integer exists, S{n) undefined otherwise. By the definitions of 
palindromic closure and the words it„, it follows that m„+i — u„a;„M„ (whence hn-i — if Xn does 

not occur in u„, and Un+i = UnUp^^^-^Un (whence /i„_iup(„) = Un) if Xn occurs in u„. Thus, if P{n) 
exists, then 

hn-l — hn-2hn~3 ■ ■ ■ hp(^n)-l7 n>l. (2.3) 

2.2.1 Strict episturmian words 

A standard episturmian word t G A'^ , or any equivalent (episturmian) word, is said to be B -strict (or 
k-strict if \B\ = k, or strict if B is understood) if Alph(A(t)) — Ult(A(t)) = B C A. In particular, 
a standard episturmian word over A is .A-strict if every letter in A occurs infinitely many times in its 
directive word. The /c-strict episturmian words have complexity (fc — l)n+l for each n G N (i.e., {k — l)n+l 
distinct factors of length n for each n G N). Such words are exactly the fc- letter Arnoux-Rauzy sequences. 

2.2.2 Return words 

Let X G A^ be recurrent^ i.e., any factor w of x occurs infinitely often in x. A return word of w G r2(x) is 
a factor of x that begins at an occurrence of w in x and ends exactly before the next occurrence of w in 
X. Thus, a return word of w is a non-empty factor w of x such that w is a prefix of uw and uw contains 
two distinct occurrences of w. This notion was introduced independently by Durand "9], and Holton and 
Zamboni [13]. 

Episturmian words are recurrent and, according to 17, Corollary 4.5], each factor of an ^-strict 
episturmian word has exactly \A\ return words. 
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3 A class of strict standard episturmian words 



Given any infinite sequence A = xiX2X^ ■ ■ ■ over a finite alpfiabet we can define a standard episturmian 
word having A as its directive word (using (|2.ip ). In this paper, however, we shall only consider a specific 
family of ^-strict standard episturmian words. 

Let Ak denote a fc-letter alphabet, say Ak = {ai, 02, ... , afe}, and suppose t is a standard episturmian 
word over Ak- Then the directive word of t can be expressed as: 

where the di are non-negative integers. In what follows, we restrict our attention to the case when all 
di > 0; that is, we shall only study the class of fc-strict standard episturmian words s G A^ with directive 
words of the form: 

1^ = 4^4^ ■■■al^a'i''^'4''^^ ■■■ , > 0. (3.1) 

This definition of s will be kept throughout the rest of this paper. 

Let us define a sequence (sn)n>i-k of words associated with s as follows: 

si-fc = 02, S2-i; = 03, ... , s_i=afe, so=ai, 

s« = 4^i4-"2' •••So'«n+i' l<n<k-l, (3.2) 

dj^ dn-l C^Ti-;c + 2 \ jL 

Clearly, s„ is a prefix of s„+i for all n > (and hence (|s„|)„>o is a strictly increasing sequence of positive 
integers). 

Example 3.1. It is well-known that the standard (or characteristic) Sturmian word Cq, of irrational 
slope a = [0; 1 + di, c?2, (^3, • ■ di > 1, (see [3] for a definition) is the standard episturmian word over 
A = {a, b} with directive word A{ca) = a"^^ b'^'^ a"^^ b'^'^ a'^'^ ■ ■ ■ . We have = hm„^oo where (s„)„>_i 
is the standard sequence associated with c^, defined by 

s_i = fe. So = a, s„ = sJ^'LiSn-2, n>l. 

This coincides with our definition (|3.2p above. Observe that, for all n > 0, |s„| = g„, where qn is the 
denominator of the n-th convergent to [0; 1 -f di, (i2, ^3, . . .]. 

For all m > 1, let Lm :^ di + d2 + ■ ■ ■ + dm- Then, writing A(cq,) = X1X2X3 ■ ■ ■ with each Xi G A, we 
have Xn+i ^ a;„ if and only if n is equal to some L,„. One easily deduces that S{L„i) = L„i+i + 1 and 
P{Lm+i -l- 1) = Lm, and it can also be shown that the /il^ satisfy the same recurrence relation as the 
qm- Hence, |/il,„| — qm, and clearly we have = Sm (see Proposition 13.11 below) . □ 

Notice that s has directive word resembling A(cq). 

Notation. Hereafter, let L„ :— di + (i2 + • • • + dn for each n > 1- 

Proposition 3.1. For any n> 1, s„ = /i^^. Moreover, s = lim s„. 

Proof. The directive word of s is given by 

Adi dn du t^fc + l dk-i-2 dok f^2fc + l ^ A 

— a-t^' a2 ■ ■ ■ a-^ ^ a2 ■■■a^.^^a-^ ^ ■ ■ ■ = xiX2XzXi- ■ ■ , Xi £ Ak- 

For n > 1, we have Xn+i 7^ Xn (and hence /i„ 7^ ^n-i) if and only if n is equal to some Lm- In 
particular, for any m > 1, 

= /iL™+i-r, 1 < r < dm+i- (3.3) 
Furthermore, it is clear that, for all n>k, 

PiLn + 1) = Ln-k+l, (3.4) 
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and P{Ln + 1) is undefined for 1 < n < A: — 1. 

First we sliow tlrat s„ = hi,^ for 1 < n < fc. Observe tfiat, for 1 < n < fc — 1, 

= <;*a^ ), 

Similarly, since = ^tl'^tl ■ ■ ■ *f^(ai), one finds tliat 

Hl^ = h'i" /if""' ■■■hfai. 

Thus, we see that the Sn satisfy the same recurrence relation as the for \ < n < k. Therefore, since 
^0 = A*o(ai) — ai — sq, we have 

Sn — hL„ for all n, 1 < n < k. (3-5) 
Now take n> k+l. Then, by ([^ and (P^ . we have 

^iri — ^-L„-l^-L„-2 • ■ ■ /iL„_fc+i^L„_fc + i-l, 

and therefore it follows from (|3.3p that 

/^i. = ^t-.fc ■ • • /^t:r+>i-.- (3-6) 

Whence, since s„ = for 1 < n < fc, (13. 6p shows that the s„ satisfy the same recurrence relation as 
the for n > k + 1. Thus, by virtue of this fact and (|3.5p . we have 



Sn = ^L„ for all n > 1, 

as required. 

The second assertion follows immediately from the first since s — lim hm |15| . □ 

m — ^oo 

Accordingly, the words (s„)„>i can be viewed as 'building blocks' of s. 

Example 3.2. The Tribonacci sequence (or Rauzy word |20| ) is the standard episturmian word over 
{a, b, c} directed by (abc)'^ . Since all di — 1, we have L„ — n, and hence hn = = 'S„_is„_2S»i-3 for all 
n > 1. □ 

3.1 Two special integer sequences 

Set Qn ■— \sn\ for all n > 0. Then the integer sequence {Qn)n>Q is given by: 

Qo 1, Qn = dnQn-l + dn-lQn-2 ^ \- diQa + I, I < U < k - 1, 

Qn = dnQn-1 + dn-lQn-2 + ' ' ■ + C?n+2-feQn+l-fe + Qn-k, n > k. 

Now, define the integer sequence {Pn)n>o by: 

Po = 0, Pn = dnPn-l + dn-lPn-2 + ' ' ' + rflPo + 1, 1 < 71 < fc - 1, 
Pn = dnPn-1 + dn-lPn-2 + ' ' ' + dn+2-kPn+l-k + Pn-k, 71 > fc. 

For fc = 2, observe that Pn/Qn is the n-th convergent to the continued fraction expansion [0; 1 + 

^1,^2,^3,^4, • ■ •]• 

Proposition 3.2. For all n > 0, |s„|q^ — Qn — Pn- 

Proof. Proof is by induction on n. □ 
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4 Generalized singular words 

Recall the standard Sturmian word of slope a — [0; 1 + di, (i2, da, . . .], di > 1. Melangon [19^ (also see 
[3]) introduced the singular words (w„)„>i of Cq, defined by 



w„ 



asnb ^ if n is odd, 
bsna~^ if n is even, 



with the convention W-2 — £, w_i = a, wq ^ b. It is easy to show that the set of factors of Cq of length 
|s„| is given by 

^|s„|(Ca) = C(s„) U {Wn}. 

(See [ini m [12] for instance.) Also note that in this 2-letter case s„ — UL„ab (resp. s„ = u^^ba) if n is 
odd (resp. even). 

Singular words are profoundly useful in studying properties of factors of (e.g., [ll lT^fTTlfTSllTOll^ ). 
It is for this very reason that we now generalize these words for the standard episturmian word s. Firstly, 
however, we prove some basic results concerning the words s„ and as detailed in the next section. 

4.1 Useful results 

For each n > 0, set Z?„ := u^^^^j. Observe that, for any m > 1, 

m — 1 

\Dm\ = {dm+1 - l)\Syn\ + ^ dj + i\sj\. (4.1) 

Indeed, using (|2.2p and (|3.3[) . one finds that 

_ <i„.+i-l d„ „'^".-i . . . „'i2 di / . 2^ 

Also note that Z?o = af^~^ since Z?o — ""di = ^di-2/idi-3 • ■ - hiho — h^^^^ . For technical reasons, we 
shall set ZJ-j := a^+i^j and |-D-j| = — 1 for 1 < j < A:. 

Proposition 4.1. Let 1 < z < A;. For all n>l~ k, a.i is the last letter of Sn if n = i — 1 (mod k). 

Proof. Since we have si-k — 02, S2-fc =03, • ■ • , S-i — o-k, sq = ai, the result follows immediately from 
the definition of the words s„ (see (|3.2p ). □ 

Proposition 4.2. For all n>0, Sn+iDn-k+i = SnDn, and hence \Dn\ - \Dn-k+i\ = |sn+i| - \sn\- 



Proof. The claim holds for < n < A; — 2 since Sn+iDn-k+i = Sn"^^ ■ ■ ■ So^ari+2a„+2 ~ and for 



Proposition 4.3. For all n>l, |s„| > jDn-ij. 

Proof. Proof is by induction on n. The result is clearly true for n = I since |si| = |a2^a2| = 11)001021 = 
I Do I + 2. Now assume the result holds for some n > 2. Then, using Proposition 14.21 

= |S„| + lAj - \Dn^k+l\ > \Dn-l\ + lAj - lAi-fc+ll > \Dn\, 

since |Ai-fe+i| < □ 



Recall that the words Dn and s„ are prefixes of s for all n G N. Thus, according to Proposition 14. 3| 
the palindromes Dq, Di, . . . , Dn-i are prefixes of s„. In fact, the maximal index i such that Di is a 
proper prefix of s„ is i = n — I, which is evident from the following result. 
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Proposition 4.4. For all n>0, _D„ = Sr!^*^ Dn_k. 

Proof. Firstly, Dq = af^^^ = Sq^q^^ = s^^D^k and, for 1 < n < fc - 1, we have 

f) _ d„ + i-l d„ . . . 

= 4"+'"'s„a,;|i (using (ISJ)) 

~ "n+1 ~ *n ^n-k- 

Now take n > k. Then 

n _ „<i.. + i-l„d„ _ <i„_fc+2 n _ d„ + i-l 7-, — s'^-'+i D , 



Proposition 4.5. For aZZ n > 0, s„ = Dn-kSnD^^j^. 



□ 



Proof. Proof is by induction on n. For n = 0, D ^^saD _\ = ^aioi = ai = So- Assume the result holds 
for some n > 1. Then, using Proposition 14. 2i 



Sn+l — SnDnD^_f.^^ — Dn-kSnD^_f,DnD^_f.^^. 

Therefore, invoking Proposition 14.41 and (j3.2p . for 1 < n < /c — 2, we have 

Sn+l = Dn-kSniSn)'^"'^^ Dn-k+1 

= Dn-k+ian+2a:^,lian+l{so)'^' ■ ■ ■ {Sn-lf"{Snf"^'D:;;^^_^_-^ 

= -Dn-fe+ian+2a„+2*n+l^n-fc+l 
= Dn-k+lSn+lD^^^.^-^. 

And, for n > fc — 1, 

-^n+l -^n — k^ni^n^ ^ ^ n — k^l 

= I?„_fe[?„_fe(s„_fc+i)'^"-'=+^-lS'„_fc+i(?„_fc+2)'^"-'= + ^ • • • {Sn-lY-]{Snf- + ^D-\^^ 
= Dn-k+lSn+lD^^\^^, 

as required. □ 

Remark 4.1. This result shows, in particular, that = D^^^^SnDn-k, i-e., s„ is the |_D„_fc|-th conjugate 
of s„ for each n > k. (For < n < k — 1, Sn is the (|s„| — l)-st conjugate of s„ since Sn = a„+is„a^j:]^.) 
The following two corollaries are direct results of the above proposition. 

Corollary 4.6. For any n > 0, the word ^nD^^^ is a palindrome. In particular, let Un = Dn-k md 
Vn = ^riDn-k- Then Sn = UnVn is the unique factorization of s„ as a product of two palindromes. 

Proof. From Proposition 14.51 we have s„ — Dn^k^nDn\ ~ UnVn, and whence D^^j^Sn = 'SnDn-k- 
therefore clear that ^nD~^^. is a palindrome. The uniqueness of the factorization s„ = UnVn is immediate 
from the primitivity of s„, which follows from Lemma [2TlT i). together with Proposition [3T] (Recall that 
since s„ is primitive, there are exactly \sn\ different conjugates of s„.) □ 

Corollary 4.7. For all n>0, s„ — Dn^nDn^ ■ 

Proof. Propositions 14.41 and 14.51 □ 
Notation. Now, for each n E N, we define the words Gn,r by 

Sn — Dn—rGn^n 1 <J T 1. 
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Example 4.1. In the case of Sturmian words Cq, r — 1 and s„ = _D„_iG„.i = ul„G'„.i for all n > 1, 
where Gn,i — cib or 6a, according to n odd or even, respectively. □ 

Example 4.2. Recall that when all rf,; = 1, s is the Tribonacci sequence over {ai, 02, as} = {a, b, c}. For 
n = 4, we have s„ = S4 = abacabaabacab, D2 — aba, D3 = abacaba, and hence 

^4.1 = abacab and G4.2 = cabaabacab. 

□ 

A^ofe. Since L'n-r = Cik+i+n-r ^"-"^ < n < r, we also set 

Gn,r = ak+l+n-rSn tOT <n< r. (4.3) 

Proposition 4.8. For all n > 1, SnSn-iG^^^^ j, -^ — s„-is„G^\. 
Proof. It is easily checked that the result holds for 1 < n < fc — 1, since 

'^n^rL — lG^_-^j^_'^ — SfiDji — k — Sjia^^-y, 

and 



Now take n > k. Then, using (|4.2p . we have 

•^n'^n—lG^_^ A:— 1 ^n^n — k 



■ Sq — s„a„_|_j^. 



*n-H,*„_l *„_2 •^„_fc+i*„_fc ^n-k-l °l *0 ^ 



□ 



Remark 4.2. Recall Example lSTTl ForcQwitha= [0; l+di, ^2, da ...], it is well-known that, for all n > 2, 
SnSn-i{xy)~^ = Sn^iSniyx)~^ , whcrc x,y ^ {a,b}, x ^ y, and xy Sn-i- This is known as the Near- 
Commutative Property of the words s„ and s„_i. Because s„s„_i(a;?/)^^ — s„Z)„_2 and s„_is„(?/x)^^ = 
Sn-iDn-i, Proposition 14.81 is merely an extension of this property to standard episturmian words s. It 
is also worthwhile noting that Proposition 14. 81 shows that s„ is a prefix of s„_is„. 



Hereafter, we set d-j — for j > 0. 

Proposition 14.21 implies that |s„_|_i| — \Dn\ = \sn\ ~ l-Dn-fc+il, and hence |G.,i4-i^i| = |G„^fe_i|. In fact, 
we have the following: 

Proposition 4.9. For all n> 1, Gn,i — Gn-i,k-i- 
Proof. One can write 



Gn,l — £'„_iSji — ^n-l^n- 


' ' ' 


-fc+1 °n-k 














Whence, it follows from CoroUarv 14.71 that Gn,i = 




G„_i,fc_i since s„_i = Gn-i^fc-iAi-^ 



□ 
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Proposition 4.10. Let 1 < i < k and 1 < r < fc — 1. For all n > 0, 

(i) ai is the first letter of Gn^r ifn = i + r — 1 (mod fc); 

(ii) Oi is the last letter of Gn ^ if n = i — \ (mod fc). 

Proof, (i) The assertion is trivially true for Q <n < r since, by (|4.3p . we have Gn,r = o-k+i+n-rSn- Now 
take n > r. By definition, 

where Sn-r+i is a prefix of s„. Hence, one can write 

Gn,r — Gn — r+l,lSyi_j._^iSn — Gn—r,k—lSyj^_j._^iSn (4-4) 

by applying Proposition 14.91 

Now, one easily deduces from Proposition 14. II that a„ -<p s„_r if n = m + r — 1 (mod fc), and thus 

am Gn^r.k-l Sn-r if « = 771 + T — 1 (mod fc). 

(ii) For < n < r, Gn.r = a/c+i+n-rSn and, for each n > r, we have Gn,r Sn- Hence, a,„ ~<s Gn,r 
if 77 = m — 1 (mod fc), by Proposition 14. II □ 

4.2 Singular n-words of the r-th kind 

By definition of the words {sn)n>i-k (see p.2p ) and the fact that s = lim„^oo s„, one deduces that, for 
any ti > 0, s can be written as a concatenation of blocks of the form s„, s„_i, . . . , Sn-k+i, i-e., 

= — *n-fe+2 '^n-k+S ^ri-k+2 ) 

We shall call this unique decomposition the n -partition of s. This will be a useful tool in our subsequent 
analysis of powers of words occurring in s (Section [SI to follow). 

Note. Uniqueness of the factorization (|4.5p is proved inductively. The initial case 77, = is trivial. For 
77 > 1, the factorization of s„ in terms of the s„_i given by p.2p is unique because the Sn-i end with 
different letters (by Proposition 14. ip . So it is clear that every {n + l)-partition of s gives rise to an 
77,-partition, in which the positions of Sn-k+i blocks uniquely determine the positions of s„+i blocks in 
the original (77, + l)-partition (since Sn+i = Sn'^^sf/lj^ • • • s^lfe'^lsn-fe+i). Accordingly, uniqueness of the 
r7-partition implies uniqueness of the (n + l)-partition. 

Remark 4.3. Since each factor of s has exactly fc different return words (see Section r2.2.2p . two con- 
secutive Sn+i~i blocks (1 < 7 < fc) of the r7-partition are separated by a word V , of which there are fc 
different possibilities. From now on, it is advisable to keep this observation in mind. 

Lemma 4.11. Let 1 < r < fc — 1. For any n G N+, a factor u of length |s„| of s is a factor of at least 
one of the following words: 

• Cj(s„), < J < |s„| - 1; 

r+k+iSn if n < r. 

Note. The word sf"Sr^^^^ ■ ■ ■ sfjTfe'^+i Sn-fcS^'li • • ■ sf^Tr+iSn-r (1 < r < fc - 1) has length |s„|. 

Proof of Lemma \4.11\ In the 77--partition of s, one observes that two consecutive s„ blocks make the 
following fc different appearances: 

Sn^n and ^n^-a—\ ' ' ' — ^n — r^ni 1 ^ r ^ fc 1. 
(*) 
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Evidently, any factor of length |s„| of s is a factor of one of the above k different words. 

Now, factors of length |s„| of s„s„ are simply conjugates of s„. Furthermore, for n > r, the first 
\Sn"-i • ■ • s'^"-r+iSn-r\ factors of length \sn\ of (*) are again just conjugates of s„. The remaining factors 
of length |s„| of (*) are factors of 

Sji-r ■ ' ' ^n-k+l ' ' ' ^n-r+l Sn-rSn- 

For n < r, one can write (*) as s„s^'l]^ • • • Sg^a„_r+fe+iSn = s„s„a^:[.]^a„_r+fc+iSn, of which the first 
|s„| — 1 factors of length |s„| are conjugates of s„, and the other factors of length |s„| are factors of 

□ 



Lemma 4.12. For any n > 1, \^n-j\ — \sn \ — k. 

Proof. Induction on n and Proposition 14.21 □ 
Lemma 4.13. Let 1 < r < fc — 1. For any n > r, we have 

Sn-r ' ' ' ^n-k+l *n-fc*n-l ' ' ' Sri-r+l ^n-r — ^n-r'~^n,n 

and for 1 <n<r, a„+is„a~+ia„_r+fc+i = Gn,r- 

Proof. For 1 < n < r, one can write Gn,r — 'snttn-r+k+i = CLn+iSna^^iUn-r+k+i, by Remark l4.ll Now 
take n>r. Then, using Corollarv l4.7l and Proposition 14.41 

= Dn-rD-\s^D,,D-^^ 

— D n-is^n+i^d,. . . . o'^"-'-+2„ 

— n s''" • • • s'^"""+^s 

_ d„_r+l-l <i7i-fc + 2 d„ d„-r+2 

— ^n-r ' ' ' ^n-k+l *n-fc*ra-l ' ' ' ^n-r+1 ^n-r- 



□ 

Whence, it is now plain to see that each word Gn,rSn = Gn,rDn-rGn,r is a factor of s. We will now 
partition the set of factors of length |s„| of s into k disjoint classes. 

Theorem 4.14. Let 1 < r < fc — 1. For any n e N+, the set of factors of length |s„| of s can he 
partitioned into the following k disjoint classes: 

• := C(s„) = {Cj(s„) : < j < |s„| - 1}; 

• fl"^ := {w G Al : |w| = |s„| and w -< x^^Gn,rDn~rGn.rX^^}, where x is the last letter of Gn,r- 
That IS, f^|s„|(s) = U U • • • U 

Proof. First observe that Lemma 12.1^ 1). coupled with Proposition 13. 11 implies that each s„ is primitive, 
and hence jrij^j = |s„|. Also note that := {-5; : w G il"} = $1°; i-^-: is closed under reversal, which 
is deduced from Corollarv l4.6l 

We shall use Lemma [4. Ill to partition f7|s^|(s) into k disjoint classes; the first being 17^ = C(s„). Now 
consider the factors of length |s„| of the words 

S^r/+'"^ • • • 4"lc+l • • • Sn-rS« {n > r) . (4.6) 

Since (|4.6|) can be written as Dn-rGn.rDn~rGn,r (by Lemma [4. 131) . the first lUn-rl + 1 factors of length 
\sn\ — \L)7i-rGn.r\ are conjugates of s^i (and hence of s„) and the last factor is just s„. Hence, all other 
factors of length |s„| of (|4.6p are factors of x~^Gn.rDn-rGn,rX^^ , where x is the last letter of Gn,r- 
Moreover, Dn-r appears exactly once (and at a different position) in each word in 

:= {w G Al- : \w\ — |s„| and W -< X^^Gn,rDn-rGn.rX~^}; 
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whence jilJ'J — |G„,r| ^ 1- Since the letter just before Dn-r (equivalently, the last letter of Gn.r) in the 
word x~^Gn,rDn-rG7i,rX^^ is different for each r € [l,k — 1], it is evident that ri°, fi^, . . . , f^^"^ are 
pairwise disjoint. 

Now, for 1 < n < r, other than words in the sets 17", 17^, . . . , Q.^, the remaining factors of length 
\sn\ of s are factors of 

(4.7) 

(see Lemma |4. lip . The first factor of length |s„| of the word (14. 7p is s„ (i.e., the (|s„| — l)-st conjugate 
of s„) and the last is just s„. All other factors of length |s„| of (|4.7[) are factors of 

Defining fJJj := {w £ : |w| = |s„| and w ~< a^l.iGn,rDn-rGn.rO'^+i}, one can check that \^^\ = 
\Gn,r\ — 1 and fl^, fi^, . . . , ^^^^ are pairwise disjoint. 

It remains to show Uj=o = ^|s„|(s) for all n > 1. Indeed, |0|s^|(s)| = {k — l)|s„| + 1 (from the 
complexity function for fc-strict standard episturmian words), and we have 

fe-1 fe-1 fe-1 

J2 m = \Sn\ + - 1) = \Sn\ + J^iK] " \D^-j\ " 1) 

fe-1 

= k\Sn\- k+l-J2\Dn-j\ 

= fc|s„| - + 1 - (|s„| - fc) (by Lemma mni) 
= (fc-l)|s„| + l. 

□ 

Let us remark that the sets ft^ are closed under reversal since x~^Gn.rDn-rGn,rX^^ is a palindrome; 
that is H.^ :— {w : we fl^^} — fl^^. We shall call the factors of s in fl^^ the singular n-words of the r-th 
kind. Such words will play a key role in our study of powers of words occurring in s. 

Evidently, for Sturmian words Cq, = and we have Q,\s„\{ca) = C(s„) U {"!«„}, as before. 

5 Index 

A word of the form w = (uv)"'u is written as w — z^, where z = uv and r := n + The rational 

number r is called the exponent of z, and w is said to be a fractional power. 

Now suppose X is an infinite word. For any w -< x, the index of w in x is given by the number 

ind(w) = sup{r G Q : ^ x}, 

if such a number exists; otherwise, w is said to have infinite index in x. Furthermore, the greatest number 
r such that w' is a prefix of x is called the prefix index of w in x. Obviously, the prefix index is zero if 
the first letter of w differs from that of x, and it is infinite if and only if x is purely periodic. 
For all n > 0, define the words 

in := Ai-fe+iG„+i,fe_i and Vn := Sn-iDn-i = s'^"_is'^"S2 ■ ■ ■ sf^ Sq\ 

Note. By convention, tq = ai^a^^ = e, and t„ = a,"^^2^n+3'5n+i for < n < fc — 2. 
The next two results extend those of Berstel [2] . 

Lemma 5.1. For all n>l, the word r„+i is the greatest fractional power of Sn that is a prefix of s, and 
the prefix index of Sn in s is I + dn+i + |-Dji-fe|/|s,i|. 
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Proof. First we take n > k. Observe that the longest common prefix shared by the words s„ and t„ is 
since 

Sn = Dn-k+lGn,k-l and tn = Dn^k+lGn+l,k-l = i^n-fc+lG„+2,l (5-1) 

where Gn,k-i and Gn+i^fc-i do not share a common first letter, by Proposition l4.ldl Clearly, Sn+iSn -<p s, 
and we have 

„ — „d„ + i d„ , , , d„-k+3 

= si^+^+\stT'-'s„^k-i ■ ■ ■ s?40G„,fe-i (by m) 

= s^+^+ii„_i. (5.2) 
Hence, Sn""*'^'^^ is a prefix of s. Also observe that the longest common prefix of t„_i and s„ is D„~k since 

tn-l — Dn-kGn,k-l and S„ = Dn-kGn.k 

where Gn,k-i and G^.^ have different first letters, by Proposition 14. 101 Further, from (|5.2p and Proposi- 
tion we have 

Sn+lSn = sj^" + ^"'""^i„-l = sf" + ^"*""^_D„_feG,i,A;-l = SnDnGn,k-l = fn+lG„,fc_l. 

Thus, the greatest fractional power of s„ that is a prefix of s is r„+i with 

|r„+i| = |s„I?„| - |<"+i+iZ?„-fc| = (d„+i + l)|s„| + \Dn-k\; 

whence the prefix index of in s is 1 + + \Dn-k\/\sn\- 
Similarly, for 1 < n < fc — 1, we have 

dn + l dn di 

^n+l^n — ^n — 1 ' ' ' ^0 ^n+2Sn 

= <"+i+iA»-fcG„,fc_i 

= Tn+lGnM-l- 

Therefore, the greatest fractional power of s„ {— Dn-kGn-i,k-i) that is a prefix of s„+is„ -<p s is r„_|_i, 
where |r„+i| = (dn+i + l)|sn| + |-D„-fc| = {dn+i + ~ 1- That is, the prefix index of s„ in s is 

l+dn+i - l/|s„| for 1 < n < fc - 1. □ 

Lemma 5.2. For all n > I, the index of Sn as a factor of s is at least 2 + dn+i + \Dn^k\/\sn\, and hence 
s contains cubes. 

We will show later that the index of Sn is exactly 2 + dn+i + \Dn-k\/\sn\. 

Proof. Setting e — 1 + dn+i + \Dn-k\/\sn\, we will show that Sn+k+2 contains a power of s„ of exponent 
1 + e. Certainly, using Proposition 14. 8[ one can write 

Sn+fe+2 = Sn'+k+l Sn+fc+lSn+A;£'n+fc£'„+2 

~ ri+fe+l ■^n+k^n+k+l'~'n+k+l,l^''i+k,k~lJ-^n+k-^n+2 
= ^j^k^i Sn+kDn+kGn+k,k-lDn+kD n+2- 

The suffix Sn+kDn+kGn+k,k-iDn+kDn+2 coutains the exponent 1 + e of s„. More precisely, Sn+k ends 
with Sn, and Dn+kGn+k,k-i shares a prefix of length |D„+fe| with Sn+k+i- Thus, since r„+i is a prefix 
of s of length 

\rn+l\ = \Sn\ + \Dn\ < |Ai+/c|, 

we have s„r„+i -< Sn+kDn+k -< Sn+fc+2. □ 
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6 Powers occurring in s 



For each m, Z G N with I > 2, let us define the foUowing set of words: 

V{m;l) := {w £ Al : \w\ = m, ^ s}, 

where s is the fc-strict standard episturmian word over Ak = {oi, 02, • ■ • , Ofc} with directive word A given 
by dsn). Also, let p{m\l) := \V{m\l)\. 

The next theorem is a generalization of Theorem 1 in 6 . It gives all the lengths m such that there 
is a non-trivial power of a word of length m in s. Firstly, let us define the following k sets of lengths for 
fixed n e N+ : 

Vi{n) := {r\sn\ ■ l<r<dn+i}, 

{144-1 •••4+2-7 Sn+i-.l : l<^<rfn+i}, 2<i<k-l, 
Vkin):={\s:,sti---stlT^Sn+i-k\ : 1 < r < - 1}. 

Theorem 6.1. Let m, n £ he such that |s„| < m < |sn+i| o^nd suppose m ^ Ui=i^i(^)- Then 
p{m;l) = for all I > 2. 

Remark 6.1. Put simply, the above theorem states that if a word w has a non-trivial integer power in 
s, then \w\ G lJi=i ^i(^) for some n. For instance, if fc = 3, we have 

3 

y V,{n) = |<s„_i| : 1 < r < d„+i} U {|<4liS„_2| : 1 < r < rf„+i - 1}. 

In the particular case of the Tribonacci sequence. Theorem 16.11 implies that if is a factor, then |w| G 
{|sri|, |sri| + |sti-i|} for somc n, where the lengths (|si|)i>o are the Tribonacci numbers: Tq = 1, Ti = 2, 
T2 = 4, = r,_i + T,_2 + T,-3, « > 3. 

The proof of Theorem 16.11 requires several lemmas. Let us first observe that in the n-partition of s 
(see (|4.5p ') to the left of each s„ block, there is an Sn+i-j block for some j G [l,k]. Also note that each 
Sn+i-j is a prefix of s„. Furthermore, to the left of each Sn+i-i block is another s„_|_i_i block or an 
Sn+2-i block, for each i G [2,k]. 

Lemma 6.2. Let n G N+. Consider a word w ^ s of the form w = uSnV for some words u, v G A\, 
e. 

(i) Lf w — U1U2, where Ui ~<s s„+i_i for some i G [1, k] and U2 ~<p Sn, then Ui = u. 

(ii) Lf w ~ uiSn+i-iU2 for some i G [2,fc], where ui -<s s„_|_2-i and U2 -<p s„, then ui — u or 

UlSn+l-i = u. 

(iii) Lf w — uiSn+i-iU2 for some i G [2, fc — 1], where ui Sn+i-i and U2 -<p s„, then ui — u or 

UlSn+l-i = u. 

Proof, (i) Other than the case when ui = u, U2 = s„ and v = e, the only other possibility is: 



W = 



u 




V 


Ui 


U2 



(Note that ui -<s s„+i_i for some i G [l,fc], and therefore < |s„+i_i| < |s„|.) 

In this case, using the figure, we write ui = uu', Sn = u'v', U2 = v'v for some u', v' (u' ^ e). As v' 
is a prefix of Sn, we have s„ = v'v" for some v" , thus u' and v" are conjugate. So there exist e, / and 
non-negative integers p, q such that v' — {ef)^e, u' — (e/)*^, and v" = (/e)' with ef primitive. Hence 
Sn = (e/)^^''e. As u' is a suffix of s„-|_i_i which is a prefix of s„, we must have, by primitivity of ef, 



13 



S'n+i-i — {<^fYi a-iid then r = 1. But u' is non-empty, so u' = ef = Sn+i-i, and it follows that u ~ e; a 
contradiction. 

(ii) Let i G [2, k] be a fixed integer. Since ui and M2 Sn, we have < |sn+2-i| < \sn\ 

and \u2\ < \sn\- Accordingly, there exist only three possibilities (other than wi = u or uiSn+i-i = u), 
and these are: 



w = 


u 




V 




Ui 




U2 



or 



w = 


u 




V 




Ui 




U2 



or 



w = 


u 




V 




Ui 


■^n+l—i 


U2 



In the first instance, us„w — uiSn+i-iU2 — uu' Sn+i-iv'v, where U2 — v'v with v' -<s s„ and ui = uu' 
with u' Sn- That is, s„ = u' Sn+i-iv' , where u' s„ and v' -<s s„, and ui = uu' Sn+2-i- 
Therefore, u' Sn+2-i, and hence the word Sn+i-i must be preceded by the last letter of Sn+2-i- 
However, since u' is also a prefix of s„ = sj^-i ' ' ' ■5„+Y-fc Sn-fcj where s„_i, . . . , s„+i_fe, s„_fc do not 
share a common last letter (by Proposition 14. ip . one is forced to presume that u' = s'^"_-^s'l"i2 ■ ■ ■ 
(resp. u' = e) when i e [3, k] (resp. i = 2). This contradicts the fact that 1 < \u'\ < |s„+2-i|- 

In the second instance, we have uSnV — uiSn+i-iU2 = uu' Sn+i-iU2, where ui = uu' with u' -<p s„ 
and U2 ^p Sn- Consider the word w' := WU2~^ = uSnVU2~^ = uSnv' , i.e., w' = uSnv' = uiSn+i-i, 
where v' -<p v and v' Sn+i-i- Since ui Sn+2-i and Sn+i-i -<p s„, it follows from assertion (i) that 
ui ~ u and hence Sn+i-i = Snv' , which is absurd unless i = 1 and v' = e. But i > 1, so this situation is 
impossible. 

Lastly, uSnV = uiSn+i-iU2 = UlSn+i-iv'v, where U2 = v'v -<p Sn with v' -<s s„. Consider the word 
w' := Ui^w = Ui^uSnV = u'snV, i.e., w' — u'snV — Sn+i-iU2, whcre u' -<s u and u' ~<p Sn+i-i- Since 
U2 ~<p Sm one obtains, as an immediate consequence of claim (i), u' — Sn+i-i, U2 — Sn, v — e, and hence 
u — ui-Sn+i-i', a contradiction since |uis„+i_i| > 

One can prove assertion (iii) in a similar manner. □ 

Lemma 6.3. Let c £ Ak and n £ N be fixed. Consider an occurrence of cs„ in s. Then the letter c is 
the last letter of a block Sn+i—i of the n-partition of s, for some i G [1, k], and the integer i {equiv. the 
block Sn+i-i) is uniquely determined by c. In particular, in every occurrence of Sn+i-iSn in s, the word 
Sn+i-i is a block in the n-partition of s. 

That is, occurrences of words w containing cs„ (c e Ak) must be aligned to the n-partition of s. 

Proof. This assertion follows from Lemma 16.21 The case n = is trivial, and for rt > 1, observe from 
Lemma 16.21 that the given s„ is either an s„ block in the n-partition of s or has an Sn+i-j block of the 
n-partition as a prefix, for some j £ [2,/c]. In the first case, to the left of s„ there is an Sn+i-i block, 
for some I £ [1, k]. Whereas, in the second case, there is an Sn+i-j or Sn+2-j block (of the n-partition) 
to the left of s„. That is, Sn is preceded by an Sn+i-i block of the n-partition for some i £ [1, k]. Since 
the last letters of s„, s„_i, . . . , Sn+i~k are mutually distinct (by Proposition [ITT]), it is clear that i (and 
hence Sn+i^i) is uniquely determined by the letter c. □ 

We can now determine the exact index of s„ in s. 

Lemma 6.4. For any n > 1, the word of maximal length that is a factor of both s and the infinite 
sequence (s„)'^ := s„s„s„ ■ ■ ■ is st"^^'^'^ Dn-k, «-e., ind(s„) = 2 + dn+i + \Dn^k\/\sn\- 



14 



Proof. According to Lemma I6.3[ any occurrence of sj^ (p > 2) must be aligned to the n-partition of s. 
By inspection of the n-partition of s (see (|4.5p ). it is not hard to see that between two successive Sn+i-k 
blocks there is a word possessing one of the following k forms: 



„q „<ir. + l „d.„ ^ ^ ^ d„+3-k 



or 

°n °n+2-i°ri+l-it'n °n-l '^n+2-fc ' « t At IJ. 

Thus, the alignment property implies that an occurrence of s^^ {p > 2) is either a prefix of 

^^^^+''4" 1 • • • (6-1) 

for some integer r < 1 and suitable zi, or a prefix of 

for some i € [2, k — 1], r < dn+i and suitable Z2. 

Now, suppose is a prefix of the word (|6.ip . Since sf"_i ■ ■ ■ s^^+^^-Tfe s„+i_fcS„ is not a prefix of s„s„ 
(in fact, it is the word Sn(sJ^+Y-fc ^Sn-fe)^^Sri = s„tn-i), sf^ must also be a prefix of 

As in the proof of Lemma 15. 1[ one can show that the prefix index of s„ in the word (|6.3p is dn+i + 
r + 1 + \Dn-k\/\sn\, which is at most dn+i + 2 + |Z?„_fc|/|s„|. Furthermore, in the word (|6.2|) . it is 
clear that the prefix index of s„ is less than for (j6.ip (since s^lj^ • • ■ s^'^^^Jj's„+i_iS„ has length less 
than the word s'^"_^ ■ ■ ■ s'^^^^-k^n+i-kSn and is not a prefix of s„s„). Whence, it has been shown that 
ind(s„) < dn+i + 2 + |Z?„_fc|/|s„|, and so the result is now an easy consequence of Lemma 15.21 (which 
gives ind(s„) > d„+i + 2 + | Ai-fc|/|s„|). □ 



The following analogue of Lemma 3.5 in [5] is required in order to prove Theorem 16. II 

Lemma 6.5. Let n G and suppose u ~< s with |s„| < |m| < |sn+i|- Then the following assertions 
hold. 

(1) For all i G [1, k], if u starts at position I in some Sn+i-i block in the n-partition of s and also starts 
at position m in some factor Sn+i-i ofs, then I = m. 

(2) For all i G [l,k — 1], if u can start at position I in s„4.i_j and at position m in s„_i, then I = m. 

Proof. By inspection of the n-partition of s, notice that, for 1 < z < A; — 1, an Sn+i-i block is followed by 
either an Sn+i-i block, an s„ block, or an s„_i block. Furthermore, an Sn+i-k block is always followed 
by an s„ block. 

Let Un+i^i be the prefix of u of length |s„+i_i|. 

(1) Let 1 < i < k and consider an occurrence of u that starts in an s„+i_i block of the n-partition 
of s. If this s„+i_i block is followed by an Sn+i~i block, then Un+i-i is a conjugate of s„+i_i as 
Un+i-i ^ s„+i_.js„+i_.j and |u„+i-i| = |s„+i_j|. Similarly, if this Sn+i-i block is followed by an Sn 
block, then is a conjugate of s„+i_i since Sn+i-i s„. And, if this Sn+i-i block is followed by 
an Sn-i block, then again u„+i-i is a conjugate of s„+i_j. Indeed, in the (n + A; — z)-partition of s, Sn+i-i 
is always followed by an Sn+fc-i block, which has Sn+i~i as a prefix; whence Un+i^i -< Sn+i-jSn+i~j- So, 
in any case, Un+i-i is a conjugate of Sn+i-i, and the result follows from the fact that the conjugates of 
Sn+i-i are distinct. 

(2) Let 1 < i < k — 1. Suppose the word u has occurrences starting in s„+i_i blocks as well as Sn-i 
blocks in the n-partition of s. (Note that this implies n > i.) First consider an occurrence of u beginning 
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in a block of the form Sn-i of the n-partition. As an s„_i block is always followed by an Sn^^-i-i block 
in the {n + k — i — l)-partition of s and Sn+i-i -<p Sn+k-i-i, we have 



Thus, in light of Lemma [6731 we have the following fact: 

cs„_j 7^ where c £ A- and c s„+i_i_fe. (6.4) 

Consider an occurrence of u starting in an Sn+i-i block of the n-partition, which can be factorized as 

Sn+l-i — S„_i Sn-i-l ■ • ■ S^^2-i-k^n+l-i-k- VD.Oj 

We distinguish two cases, below. 

Case 1: The word u begins in the left-most s„_i block in (j6.5p when dn-i+i > 2. In this case, — 2 
is a conjugate of s,i-i and hence, as deduced in (1), the starting position of u in this s„_i block must 
coincide with its starting position in any occurrence of Sn-i in the n-partition of s. 

Case 2: The word u does not start in the left-most Sn-i block in (|6.5p . The block to the right of 
Sn+i-i in the n-partition is either another Sn+i-i, or an Sn-i, or an s„. In any case, Sn-i is a prefix of 
this block to the right of s„+i_i, which implies u„+i_i contains an occurrence of Sn+i-i-kSn-i- This 
contradicts (16.41). □ 



Proof of Theorem \6.1\ Clearly, pirn^li) > p{m;l2) if h < h- Thus, it suffices to show that for m ^ 
Ui=i we have 

p(to;2) = 0, (6.6) 

i.e., there are no squares of words of length m in s. 
Suppose (|6.6p does not hold for some m satisfying 

fc 

m^[jV,{n), (6.7) 



and let m be a word of length m with \sn\ < m < \sn+i \ such that u -< s. For convenience, we shall write 
y(i)y(2) allow us to refer to the two separate occurrences of u. Let 1 < i, j < k. Obviously, u'^^'' 



u 



2 



starts at position q, say, in some s„+i_i block of the n-partition of s. Further, by Lemma 16.51 m*^^^ also 
starts in some Sn+i-j block of the n-partition of s at position q. From the proof of Lemma 16.41 recall 
that two consecutive Sn+i-k blocks in the n-partition of s are separated by a word of one of the following 
k forms: 



st^-^, re {0,1}, 



or 

If we also keep in mind that |s„| < |u| < |s„+i|, then using Lemma 16.51 we see that the possible lengths 
|m| of u are: 

|s„| and \s„s^"_-j^ ■ ■ ■ s^__^J^l Sn-i\ 

where 1 < i < fc — 1 and 1 < r < d„+i (with r =/= (i„_|_i if i = fc — 1 as |w| < |s„_|_i|). Therefore, m does 
not satisfy (|6.7p : a contradiction. □ 

The next five propositions, which have some interest in themselves, are needed in the next two sections 
where we shall prove our main results concerning squares, cubes, and higher powers in s. 

Notation. Given I e N and w G A^, denote by Pref/(w) the prefix of w of length I if \w\ > I, w otherwise. 
Likewise, denote by Suff;(z/;) the sufhx of w of length I if \w\ > I, w otherwise. 

Recall that fij^ denotes the set of singular n- words of the r-th kind (l<r<fc— 1), as defined in 
Theorem 14. 141 
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Proposition 6.6. Let n G N"*". Suppose w £ for some i G — 1] and let v = Pref;(w) where 

I < I < |G„+i_i^i| — 1. Then the word vsn+i~i occurs at position p in s if and only if the n-partition 
of s contains an s„ starting at position p + I and an Sn-i ending at position p + I — \ . In particular, w 
occurs at exactly those positions where us„+i_i occurs in s. 

Proof. Let z G [1, A; - 1] be fixed and let 1 < / < |G„+i_j,i| - 1. 

First note that |u;| — |s„+i_i| and w -< x^^Gn+i-i.iDn-iGn+i-i.ix^^ where x G Ak, by definition 
of Since \Dn_iGn+i-i.i \ — \sn+i-i\, the word v — FTefi{w) is a suffix of x^^Gn+i-i,i which, in 

turn, is a suffix of Sn-i as Gn+i~ui = Gn-t,k-i- 

Now, by Lemma 16.51 the word Sn+i-i can only occur at the starting positions of blocks (in the n- 
partition) of the form s„, s„_i, . . . , Sn+i-fe, all of which have different last letters (by Proposition 14. 1 p . 
In particular, each s„_j block (0 < j < fc — 1, j / i) of the n-partition of s has a different last letter to 
Sn-i (and hence v). One should note, however, that an Sn-i block of the n-partition is never followed by 
an Sn+i-i block (except if i = 1, in which case we do have certain s„ blocks preceded by s„_i blocks). 
Also observe that if z = Suff/(s„_|-i_i), then an Sn~i block of the n-partition is only ever followed by 
Sn+i-iZ^^ — Dn-iGn+i-i.iz^^ if it is foUowed by an s„ block of the n-partition. Taking all of this into 
account, one deduces that the word vSn+i-i occurs only at positions in s where an Sn-i block of the 
n-partition is followed by an s„ block, which has Sn+i-j as a prefix. This completes the proof of the first 
assertion. 

As for the second assertion, recall that w begins with the word v which is a non-empty suffix of 
x~^Gn+i-i,i = x~'^Gn-i,k-i -<s Sn-i- Consequently, w occurs at every (|sri_i| — I + l)-position of an 
Sn-i block that is followed by an s„ block in the n-partition of s, i.e., w occurs where the prefix vSn+i-i 
of vSn occurs in s. By Lemma 16.51 the only other position where w may occur (besides where vsn+i-i 
occurs) is in the (|s„_i| — I + l)-position of an s„ block that is preceded by an Sn-i block. Now, to the 
right of this type of s„ block (in the n-partition) there appears another s„ block or an s„_i block. The 
fact that Sn+i-iSn-i s„s„_i SnSn implies that w ends with the prefix of Sn-i of length \sn-i \ — l- 
More precisely, w ends with the word 

Dn+l-i-kZl 

where zi is a non-empty prefix of G„+i_i^i oflength \zi\ — |G„+i_i^i|— L On the other hand, by definition 
of w, we have that w ends with 

Dn-iZ2 

where Z2 is a non-empty prefix of Gn+i-i.i of length \z2\ = |G„+i_i.i| — /. It is impossible for both 
situations to occur, so we conclude that w occurs at exactly those positions where vSn+i-i occurs. □ 

Notation. For n > 1, denote by P„ the set of all formal positions of s^"^^ ^ Sn-i in the (n — l)-partition 
of s. 

Proposition 6.7. For any n G N+, the set of all positions of Dn in s is P„. 

Proof. Proof is by induction on n. For n = 1, Dn = Di = s^^~^Sq^, and hence Di occurs at exactly 
those places in s where s'[^~^sq = {a'[^a2)'^'^~'^ai occurs in the 0-partition of s. We claim that there is a 
one-to-one correspondence from the set of all positions of Dn in the (n — l)-partition of s to the set of all 
positions of Dn+i in the n-partition of s (see (|4.5p ). Assume that P„ gives all of the occurrences of Dn 
in the (n — l)-partition of s. Since Dn+i = s^^+i ^SnDn — -DnS„(s„+i)''"+^^^, Dn+i occurs at any place 
in P„_|-i. Conversely, since each occurrence of in (|4.5p naturally gives rise to an occurrence of Dn 

in the (n — l)-partition of s, the word D„+i must occur in s at exactly those places given by P„+i. □ 

Consider two distinct occurrences of a factor w in s, say 

s = uwv ~ u'wv', \u'\ > \u\, 

where v, v' G A'^. These two occurrences of w in s are said to be positively separated (or disjoint) if 
> \uw\, in which case u' — uwz for some z G A'^^ and hence s = uwzwv'. 
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Proposition 6.8. For any n G N"*", successive occurrences of a singular word w S Uj=i ® '^''^ 

positively separated. 

Proof. Let 1 < r < k — 1. For 1 < n < r, observe that 

fln^iweAl : = |s„| and w -< s„_i£>„_iL',;;;^^s„_ii:'„_i}, 

where Dn-r = fln-r+fe+i for 1 < n < r. It is left to the reader to verify that consecutive occurrences of 
a word w £ fl^ ^ n < r) are positively separated in s. 

Now take n > r + 1 and suppose w G fij^. Then Dn-r will occur in w. By Propositions 16 . 7l the word 
Dn-r occurs at exactly those places where s^l^"^^^ ^s„_r-i occurs in the (n — r — l)-partition of s. First 
note that the letter just before Dn-r in ^ is the last letter of Gn^r, which is the first letter Gn,r, and 
hence the last letter of s„-r-fe (by Propositions 14.11 and I4.l0|) . On the other hand, in the word w, the 
letter just after Dn-r is the first letter of Gn^r- Since there are k different return words of s'^^Sr^^ s^-r-i 
in s, there exist k different possibilities for occurrences of s^1~'"''^s„_r-i in the [n — r — l)-partition of s; 
namely: 

(1) 

/ Ci„_r+1 — 1 N dn-r — 1 ^71-1 — 1 ^n-i fc + 2 / ^n-r+l — 1 \ 

\^n-r — 1 J^ri-r-l ^''n-i — 2 ' ' ' '*ri-r-fe+l v'^n-r Sn-r-l) 

— D D^^ D D^^ 

— J-^n-rJ-^n-r-k^n-rJ-^n-r-l 

— D R \dr,-r+l n-l 

— -'-^n—r\^n—r J n— r — 1 

— n CS Adn-r+l-lr* 

— -'-^n—r\^n—r J ^n—r,l: 



(2) 



\^n-r Sn-r-l)Sn-, — 1 ^n-r-2 ' ' ' ^n-t — i+1 •^n-r-U'Sri-?- 



Dn-rD^_^ — l{Sn-r ^ Sn-r-l) 



Dn — rGn — rJ^rL—r ^n—r — l if ^n — r+1 ^ 2, 

Dn-rG„-r-l,l-l ifc?n-r+l = l, 



where 2 < / < fc — 1; 

(3) 




r+1 

if dn-r+1 — 1- 

Thus, if c^n— r+1 ^ 2, the word D^—r is followed by either Sn—rj Gn~r,h or Sn— r— ij of which only Sn—r 
has the same first letter as Gn,r- Similarly, if dn~r+i = 1, the word Dn-r is followed by either Gn-r,i, 
Gn-r-i,i-i, or Gn-r-i,k-2, of which Only Gn-r.i has the same first letter as Gn,r- Therefore, only in case 
(1) will we have Dn-r followed by the first letter of Gn,r- Accordingly, one deduces that any occurrence 
of w in s corresponds to a formal occurrence of the word 

Sn-r-k[Sn-r ■^n-r-1 ) ^n-r-l ^n-r-2 ' ' ' ^ n-r-k+l^ri-i — kK^n-r Sn-r-l) 

in the (n — r — l)-partition of s. Hence, we conclude that occurrences of w are positively separated in s 
since a word of the above form is positively separated in the [n — r — l)-partition. □ 



The next proposition follows from Lemma 16.41 and Propositions 16.61 and [6] 

Proposition 6.9. Let n G and suppose u ^ s with \u\ — \sn\- Then u'^ ^ s if and only if u E C(s„). 
In particular, if u is a singular word of any kind of s, then 7^ s. Moreover, for any n>k — l, ifu^^s 
with \sn\ < \u\ < |s„+i|, then u does not contain a singular word from the set ^^^j+2-/c- 
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Proof. As s„"^^ Dn-k -< s (see Lemma [6.4p . the square of any conjugate of s„ is a factor of s. (Note 

.-1 

■'n+l 



that Sn'''''^'^'^ Dn-k = Sn"^^^'^a for 1 < n < fc — 1.) Now recaU that the set of all factors of s of 



length \sn\ is the disjoint union of the sets C(s„) and Uj=i ^n- Consequently, the first two assertions are 
deduced from Proposition 16. 81 

For the last statement, let n > fc — 1 and suppose — u'^^u'^' is an occurrence of in s, where 
\sn\ < 1^1 < |s„+i|. Also assume w e ^n^2-k w -< u. Clearly, w occurs in both m^^^ and u^^^ at the 
same position. By Proposition 16.61 (with i = fc — 1), different occurrences of w correspond to different 
occurrences of s„+i-k blocks in the n-partition of s (as an s„+i-k block is always followed by an s„ 
block). Between two consecutive s„+i_fc blocks in the n-partition, there is a word taking one of the 
following fc forms: 

or 

Therefore, the distance between consecutive occurrences of Sn+i-k blocks in the n-partition of s is 

|s„+i| (r = 0), |s„+i| + |s„| (r = l), or |s„+i| - (|i:)„+i_j| - lAi+i-fel) + |s„+i|, 
and all of these distances are at least |sn+i|, which implies \u\ > |s„+i|; a contradiction. □ 
More generally, we have the following proposition. 

Proposition 6.10. Let n e and suppose ^ s with |s„| < |u| < |s„+i|. Then u does not contain a 
singular word from the set f2^_|_]^_,j for any i (£ [1, fc — 1]. 

Proof. The case when i = fc — 1 is proved in Proposition l6.91 so take i £ [1, fc — 2] . Let — w'^^u'^^ be an 
occurrence of in s, where |s„| < |u| < |s„+i|. Assume w G for some i G [1, fc — 2], and w ^ u. 

Clearly, w occurs in both u^^-* and u'^' at the same position. By Proposition l6.6l different occurrences of 
w correspond to different occurrences of Sn~i blocks that are followed by s„ blocks in the n-partition of 
s. By inspection of the n-partition (see (14.51) ). the word of minimal length that separates two such Sn-i 
blocks is 

That is, the minimal distance between two consecutive occurrences of an Sn-i block (with each appearance 
followed by an s„ block) is |s„+i| + |s„| + \Dn\ — \Dn-i\ > \sn+i\ + \sn\, which implies |u| > |s„+i| + |s„|; 
a contradiction. □ 



6.1 Squares 

The next two main theorems concern squares of factors of s of length m < di + I = \si\ and length 
rn > |si|, respectively. 

A letter a in a finite or infinite word w is said to be separating for w if any factor of length 2 of w 
contains the letter a. For example, a is separating for the infinite word {aaba)^ . If a is separating for 
an infinite word x, then it is clearly separating for any factor of x. According to [H Lemma 4] , since the 
standard episturmian word s begins with oi, the letter ai is separating for s and its factors. 

Theorem 6.11. For 1 <r <di, we have 

, Jl */r<(di + l)/2, 

p(r;2) = < 

[O z/r> (di + l)/2. 

In particular, V{r;2) = {{a\f} for r < [di + and V{r;2) ^ % for r > [di + l)/2. 
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Proof. Consider a factor m of s with \u\ = r < di. As ai is separating for s and ai occurs in runs of 
length di or di + 1 (inspect the 0-partition of s), we have that u is either or a conjugate of a^^^aj 
for some j, 1 < j < k. Further, it is evident that there are no squares of words conjugate to ai^^aj, 
1 < j < k. And, using the same reasoning for words u of the form a^^, one determines that s contains the 
square of u if and only if 2r < di + 1, in which case there exists exactly one square of each such factor 
of length r of s; namely (a^)^. □ 

Let w be a factor of s with \w\ G UiLi^j(") for some n. Roughly speaking, the next theorem 
shows that if is a factor of s, then w is a conjugate of a finite product of blocks from the set 
{s„, Sn+i, . . . , Sn+i-k}, depending on \w\ and dn+i- For example, if \w\ = r|s„| for some r with 1 < r < 
1 + d„+i/2, then ^ s if and only if w is one of the first |s„| conjugates of s^. 

Theorem 6.12. Let n, r 

(i) Forl<r<dn+i, 



\sn\ i/l < r < l + d„+i/2, 

P(kJil;2) = { |Ai-/c| + 1 if dn+i is even and r =1 +dn+i/2, 
if l + dn+i/2 <r <dn+i. 



(6.8) 



In particular, 

^(l<l;2) 



{CM) 



0<i<|s„|-l} z/l < r < l + d„+i/2, 
< J < |-Dn_fe|} if dn+1 is even and r = 1 

/2 < r < dn+i- 



hi+l 



/2, 



(6.9) 



(ii) For 1 < r < d,i+i and i £ [2, k] (with r ^ dn+i if i = k), 



have 



P(KSn-l • ■ ■ 4+^2-/Sn+l-z|; 2) = | 



1. 



(6.10) 



In particular, 



:r2~75n+l-.|; 2) = • ■ ■ <r2-75n+l-») : < J < |A.+ 1-»|}. (6.11) 



Remark 6.2. For standard Sturmian words Cq, we have s„ — Dn-ixy, where x, y £ {a, 6} {x ^ y), and 
hence lUn-il = (7„ — 2 for all n > 1. Accordingly, Theorem 16 . 1 21 agrees with Theorem 3 in 6J for the case 
of a 2-letter alphabet. 

The proof of Theorem 16.121 requires the following three lemmas. 



Lemma 6.13. Letn E N+ and setui :— sj^s^'lj^ 
Then, for all i G [2, k], we have 



■ ■ s„+"2^_i Sn+i-i for each i G [2, fc] and 1 < r < dn+i — 1. 
-< s, (6.12) 



and 



2 T~) , / dji^i dn + 3 — i \2 

^i-Un+l-i \S„ S„_i • • • S„+2-i • 



(6.13) 



Proof Let us first note that, for i = fc, (sfr^^s^'^i • • ■ s'^"^^2-k ^n+i-k)'^ Dn+i-k = s^+i is a factor 



dn + 3-k ^ \2 n — £.2 

+2-fc*«+l-fcJ I>n+l-k - S„- 

of s (by Lemma [5. ip . Now, for i G [2, fc — 1], by inspection of the n-partition of s, the word 



°n+2-k *Ti+l-fc*„ 



(4" 



n2 „d7i4 



-1 d„ 



; -L.'n+l-2^„+l_fcA„ 



S" 
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is a factor of s (where Dn+i~k is a prefix of s„ for n > fc — 1, and D~^^_^. = an+2 for 1 < n < fc — 2). 
Thus, assertion (|6.12p is proved. 

As for the second assertion (|6.13|) . one can write 

_ d„ + i-r r d„ + i-r <i„ . . . „'^" + 3-i „ . 

„ d„ + i-r 2r, n-1 <i„+i-r~l £i„ . . . „d,i+3-i . 

which yields the resuh since Dn-k is a prefix of s„ and s„_i for n > k, and D~^f. = a„+i for 1 < n < 
fc - 1. □ 

Lemma 6.14. Let n £ and let = u^^^u'^'^^ be an occurrence of v? in s, where |s„| < |m| < |s„^i|. 

(i) For all n > 1, if \u\ — |sj^| with 1 < r < dn+i, then u^^^ begins in an Sn block of the n-partition of 
s. Moreover, is a factor of st"^^^"^ SnV^^ = s^T^^^'^ Dn-k, where \v\ = |s„| — |I?„_fe|. 

(ii) Let id [2, fc — 1]. For all n > i — 1, if \u\ = Is^s^^Li ■ ■ ■ s'^2-i ^n+i-i\ with 1 < r < dn+i, then u^^^ 
starts in an Sn block and contains an Sn+i-j block that is followed by an s„ block in the n-partition 
of s. Moreover, v? is a factor of (sj^s^^j^ • • • sf'^''^^jr/s„+i_i)^I?„+i_i, which is a factor of 

(iii) For all n > k — 1, if \u\ — \Sj^s'^"_i- ■ ■ s'^2-k^n+i-k\ with 1 < r < c?„+i — 1, then u^^^ starts 
in an s„ block and contains an s^+i-k block of the n-partition of s. Moreover, v?' is a factor of 
i-'^ns't-i • ■ • s^+"^2'~fc ■5n+i-fc)^Ai+i-fc. which is a factor of 

2 _ / <i„ + i d„ ^ ^ ^ d„ + 3-k n2 
*Ti+l — l*r! *n+2-A: • 

Proof, (i) By similar arguments to those used in the proof of Theorem 16.11 the first claim is obtained 
from the fact that — r\sn\ with 1 < r < (i„+i, together with Lemma 16.51 For the second claim, one 
uses the fact that an s„ block in which u'^^ starts is followed by the word s^s'^_i ■ ■ ■ s^^^t^^^ Sn+i-iS„, for 
some i £ [2, fc] and < p < dn+i- Hence, we have 



't!-1 ' ■ ' ''n+2-i *«+l-«''n'-^n,i-l ^ *"l*n+l-i ^n-i ' ' ' °n+l-k ^n-k ) 



= s„_is„G„\ (bv Proposition [48 



where \v\ = |s„| — \Dn~k\- Therefore, the assertion holds provided u'^^ does not contain the word 
Sn-iSniw^'^Gn,!}^^ for somc non-empty proper prefix w of Gn,i- Indeed, if s„_is„(?x;~^G„,i)~^ -< u*-^-*, 
then Sn-iDn-iw — Dn-kGn.iDn-iw is a factor of u^'^\ where w -<p Gn,i- But this situation is absurd 
(by Proposition 16 . 1 0)1 since this word contains a singular n-word of the first kind. 

(ii) From Lemma 16.51 we can argue (as in the proof of Theorem 16. ip that it^^^ begins in an Sn+i-i 
block that is followed by an s„ block in the n-partition, or contains an s„+i-i block that is followed 
by an s„ block. However, in the first case, we see that u would contain a singular word from the set 
since u would contain s„+i_iS„, which has Sn+i-iSn+2-i as a prefix (see Proposition 16. 6p . This 
contradicts Proposition 16.101 and so only the second case can occur. By reasoning as above and using 
the fact that \u\ = |sj^s^"_x ■ • • s^!^^^_r/s„+i_i|, u^^^ must start in the left-most s„ block in the word 

r dn dn+3—i 
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which appears in the n-partition of s. Since Dn+i-i -<p Sn+2-i s„, u^^^ ends within the first |_D„+i_i| 
letters of the s„ block to the right of the Sn+i^i block. Otherwise, u would contain a singular word 
u) G r2jj+2-i which contradicts Proposition 16.101 To the left of the Sn+i-i block, there is the word 
st"^^s'^"_i ■ ■ ■ s'^"^2~k^n+i~k and, in view of the fact that |u| = \SnSn"_i ■ ■ ■ s'^":^2-i^n+i-i\ with 1 < r < 
dn+i, one deduces that there exists a p G N with < p < \Dn+i~i\ such that u^^^ starts at position p in 
Sn^t-i ■ ■ ■ SnX2-i'^n+i-iSn- This implies ^ v'^Dn+i-i where v := s^sjjli • ■ ■ s'^'^^^- Sn+i^i- It remains 
to show that v'^Dn+i-i is contained in 

which, in turn, is a factor of s. Indeed, this fact is easily deduced from Lemma 16.131 

(iii) The proof of this assertion is similar to that of (ii), but with i = k and 1 < r < dn+i — 1. The 
details are left to the reader. □ 

Lemma 6.15. For all n, r G N"*" and i 6 [2, k], the word v := s^iSj^'lj^ • • ■ s'^"j^2~i^n+i~i is primitive. 

Proof. Suppose on the contrary that the given word v is not primitive, i.e., suppose v — for some 
non-empty word u and integer p > 2. Then \v\aj = p\u\aj for each letter aj e Ak, i.e., p divides \v\aj for 
each aj G Ak. In particular, p divides 

kUi = f (Qn - Pn) + dniQn-1 " Pn-l) H h {Qn+l-i - Pn+l-i) 

= \v\ - {rPn + dnPn-1 H h (i,i+3-i^'n+2-i + Pn+l-i), 

by Proposition 13.21 Thus, p must also divide rP„ + (i„P„_i + • ■ • + rf,i+3-i-Pri+2-i + Pn+i-i- But 
gcd(|u|, rP„ + dnPn-i + • ■ • + dn+'i-iPn+2-i + Pn+l-i) = 1, which yields a contradiction; whence p = 1, 
and therefore v is primitive. □ 

Proof of Theorem \6.12\ We simply prove (|6.9p and (|6.1ip as the elements of the respective sets are mu- 
tually distinct (by Lemma [6.15[) . which imphes (|6.8p and (|6.10p . 

(i) As we have already seen, for each i e [2, fc], v := Sn"'^^s^'li • • • s„+^tr/s„+i_iS„G~^_;^ = 
Sn"^^^'^Dn-k is a factor of s. Thus, by Lemma [6.14r i). it suffices to find all squares of words u with 
\u\ — r\sn\ (1 < J' < dn+i) that occur in the word v. In fact, one need only consider occurrences of 
starting in the left-most s„ block of v, and the result easily follows. 

(u) Let i G [2, fc — 1]. By Lemma [5. 131 the word (sn""^^s^'Li • ■ ■ Sj^"^2-i ^n+i-i)'^ is a factor of s and it 
contains the word 

V := (s„S„'li • ■ • S„+2-» Sn+l-t) Dn+l-i, 

for any r with 1 < r < dn+i. Therefore, is a factor of s for each word u given by 
u ■■= Cj«4-i • • • 4+^2-/ < j < \Dn+i^,\. 

Conversely, if ^ s with |u| — \s^s'^"_-^ ■ ■ ■ sf^^^-i ^n+i-i\ for some 1 < r < d„+i, then -< v, by 
Lemma [6.14( 11). And, since \v\ — 2\u\ + \Dn+i-i\, we must have u — Cj{s^s'^"_i • • ■ s.^"^2-i ^n+i-i) for 
some j with < j < \Dn+i-i\. 

The case i = fc is proved similarly, using Lemma |6 . 14f iii) . □ 

6.2 Cubes and higher powers 

Our subsequent analysis of cubes and higher powers occurring in s is now an easy task due to the above 
consideration of squares. Extending Theorem 16. 121 fsee Theorem 16. 1 71 below) . only requires the following 
lemma, together with arguments used in the proof of Theorem 16.121 

Lemma 6.16. Let n G and suppose u'^ ^ s with |s„| < |u| < |s„+i|. Then does not contain a 
singular word from the set for any i € [1, fc — 1]. 



22 



Proof. Suppose on the contrary that ~ u^^"^ u^"^^ u^^^ contains a singular word w G for some 

z S [l,fc — 1]. By Proposition 16.101 w is not a factor of u^'^^ and therefore every occurrence of w must 
begin in u^^^ or u'-^\ both of which arc foUowed by u again. Accordingly, there exists a p S N such that 
w starts at position p in both u^^^ and u''^\ Reasoning, as in the proofs of Propositions 16.91 and 16.101 
yields the contradiction \u\ > □ 

Theorem 6.17. Let n, r, I e N+, I > 3. 

(i) Forl<r<d„+i, 

(\sn\ 1/ 1 < r < (d„+i + 2)//, 

P(l<l;0= < |A^-fc| + l */r = K,+i+2)/;, (6.14) 

[o lf{dn+l+2)/l<r<dn+l. 

In particular, 

f{C,(<J : 0<j<|s„|-l} z/1 <r < (d„+i + 2)//, 

I : 0<j< z/r= K+i + 2)/;, (6.15) 

(ii) For 1 < r < dn+i and i G [2, k] (with r ^ dn+i if i ~ k), we have 

P(|Sn4-l • • • 4^2-./ = 0- (6-16) 

Proof, (i) Suppose u < s with |u| = r|s„| for some 1 < r < c?„+i, and consider an occurrence of 
y} = ^(1)^(2) . . . u(0 in s, ^ > 3. By Lemma [6.14^ 1). u'^^^u'^^) begins in an s„ block of the n-partition of 
s, and by Lemma I6.16i v} does not contain a singular n-word of the first kind. So, as in the proof of 
Theorem I6.12f i). one infers that v} is contained in the word v := s^T*^'^'^ Dn-k < s, and the rest of the 
proof follows in much the same fashion. 

(ii) In this case, assume — u'^^u^^^u^'^) occurs in s. By (ii) and (iii) of Lemma [6.141 u^^^ begins 
in an s„ block and contains an Sn+i-i block that is followed by an s„ block in the n-partition of s. 
Accordingly, u"^ contains the word s„+i_iS„, and hence contains a singular word w G which 
contradicts Lemma 16.161 □ 

6.3 Examples 

Example 6.1. Let us demonstrate Theorems 16.121 and 16.171 with an explicit example. Consider the 
standard episturmian word s over ^3 ~ {oi, 02, as} = {a, 6, c} with periodic directive word [ahcca)^ . We 
have {dn)n>i — (1, 1, 2, 2, 1, 2), and hence 

si = ab, |si| = 2; 

52 = abac, \s2 \ — 4; 

53 ~ abacabacaba, {s^l — 11; 

54 = abacabacabaabacabacabaabacabacab, |s4| — 32; 

55 = abacabacabaabacabacabaabacabacababacabacabaabacabacabaabac, {s^l = 58. 

Also, Do = s, Di = a, D2 ~ abacaba, and = abacabacabaabacabacaba. 

We shall simply consider squares and cubes of words of length m occurring in s with |s3| < m < |sg|. 
By Theorem 16. 11 we need only consider lengths m in the set 



S {ksl, 2|S3|, |S4|, |S5|,2|S5|, IS3S2I, \sl^2\, \s3slsi\, IS4S3I, IS5S4I, 15^541, IS5S4S3I}. 
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According to Theorem 16. 12f i). for 3 < n < 5, we have 

p{\s3\;2) = |s3| = 11, 
p(2|s3|;2) = |i?o| + l = l, 
p(|s4|;2) = |s4| =32, 
p(|s5|;2) = |S5| = 58, 
p(2|s5|;2) = |i?2| + l = 8. 

Also, part (ii) of Theorem 16. 121 gives 



P{ S3S2|;2) = 


1^2 1 + 


1 


= 8, 


Pi 4s2\;2) = 


1^2! + 


1 


= 8, 


P( S3slsi\;2) 




+ 


1 = 2, 


p{ S4S3|;2) = 


\D3\ + 


1 


= 23, 


P{ S5S4|;2) = 


\D,\ + 


1 


= 34, 


Pi S5S4|;2) = 


\Di\ + 


1 


= 34, 


p{ S5S4S3I; 2) 


= m 


+ 


1 = 23 



Furthermore, from Theorem 16. 17i one has 

P(|S3|;3) = |S3| = 11, 

P(2|S3|;3) = 0, 
p(|s4|;3) = |i?i| + l = 2, 
P(|S5|;3) = |S5| = 58, 
P(2|s5|;3) = 0, 

and p(to; 3) = for aU other lengths m £ S. 

For instance, the sole factor of s of length 2|s3| = 22 that has a square in s is 

S3 = abacabacabaabacabacaba, 

and the eight squares of length 2|s3S2| = 30 are the squares of the first eight conjugates of S3S2 = 
abacabacabaabac; namely 

(abacabacabaabac)^ , (bacabacabaabaca)'^ , ... , [cabaabacabacabaf' . 

The only factors of length |s4| = 32 that have a cube in s are the first two conjugates of S4, i.e., 

s\<s and (Ci(s4))^ = [a^^s^af' -< s. 

□ 

Example 6.2. The k-bonacci word is the standard episturmian word ry^, G with directive word 
(aia2 • • -Ofc)". Since all di — 1, we have s„ = s„_is„_2 • • ■ Sn-k for all n > 1 (and the lengths \sn\ are 
the k-bonacci numbers). Thus, for fixed n G and I >2, \i < r/^, with |s„| < \w\ < \sn+i\, then 
we necessarily have \w\ — |s„| + |s„_i| + • • • + |s„+i_i| for some i G [1,A: — 1] (by Theorem 16. ip . The 
preceding main theorems reveal that 

7'(l;2) = {ai}, T'ds,,!; 2) = C(s„) = 17° and P(|s„|; 3) = {Cj(s„) : < j < |i?„_fcl}. 

Furthermore, for each i G [2, fc — 1], we have 

VilSnSn-l ■ • ■ S„+l-i|; 2) = {Cj{SnSn^l ' ' ' S„+i_i) : < j < \Dn+l-i\}. 

All other Pdw]; /) = 0, / > 2. In particular, fc-bonacci words are 4-power free. □ 
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7 Concluding remarks 



Using the results of Section 6, it is possible to determine the exact number of distinct squares in each 
building block s„, which extends Fraenkel and Simpson's result [10] concerning squares in the finite 
Fibonacci words. Such work forms part of the present author's PhD thesis [13[ Chapters 6 and 7]. 

Theorems 16 . 11 [ [6. 121 and 16 . 1 71 also suffice to describe all integer powers occurring in any (episturmian) 
word t e that is equivalent to s. (See [HI Theorem 3.10] for a definition of such t.) The problem of 
determining all integer powers occurring in general standard episturmian words (with not all di necessarily 
positive) remains open. 
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